Recurrence interval analysis of trading volumes 
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We study the statistical properties of the recurrence intervals r between successive trading volumes 
exceeding a certain threshold q. The recurrence interval analysis is carried out for the 20 liquid 
Chinese stocks covering a period from January 2000 to May 2009, and two Chinese indices from 
January 2003 to April 2009. Similar to the recurrence interval distribution of the price returns, the 
tail of the recurrence interval distribution of the trading volumes follows a power-law scaling, and 
the results are verified by the goodness-of-fit tests using the Kolmogorov-Smirnov (KS) statistic, 
the weighted KS statistic and the Cramer-von Mises criterion. The measurements of the conditional 
probability distribution and the detrended fluctuation function show that both short-term and long- 
term memory effects exist in the recurrence intervals between trading volumes. We further study 
the relationship between trading volumes and price returns based on the recurrence interval analysis 
method. It is found that large trading volumes are more likely to occur following large price returns, 
and the comovement between trading volumes and price returns is more pronounced for large trading 
volumes. 

PACS numbers: 05.45.Tp, 89.75.Da, 89.65. Gh 



I. INTRODUCTION 



Numerous studies have been conducted to better un- 
derstand the dynamic behavior of financial markets. 
Most of the studies mainly focus on the statistical proper- 
ties of the independent events occurring at discrete time 
steps. In recent years, the waiting time between consec- 
utive events has drawn much attention, for instance, the 
intertrade duration between two consecutive trades [lHj] , 
the duration time that the price or volatility keeps below 
or above its initial value [5|-0] , and the waiting time that 
the price return first exceeds a predefined level [cj-[i~3j]. 
To the best of our knowledge, the concepts raised above 
mainly concern the waiting time between normal events 
in financial markets, i.e., the measurement of the events 
themselves has not been considered. 

The study of recurrence intervals between extreme 
events in natural records, e.g. floods, temperatures and 
earthquakes (l4l - fl~7j , has become a focus of research in cli- 
matic cataclysm, and has been widely studied by many 
scientists. It offers an inspiration that the analysis of 
recurrence intervals between extreme events with large 
price fluctuations may offer an alternative way of learning 
the dynamic behavior of financial markets. Rather than 
studying the waiting time between ordinary events, the 
recurrence interval analysis is more concerned about the 
duration time between events exceeding a certain thresh- 
old level. The recurrence intervals between volatilities 
exceeding a threshold q have been carefully studied, and 
it gets to two primarily divergent opinions about the re- 
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currence interval distribution: power-law and stretched 
exponential distributions are observed respectively for a 
variety of representative financial records |18l - l29| . 

Meanwhile, the recurrence interval analysis is also car- 
ried out for stock returns [30l - l34j . Numerical simulation 
studies have shown that the long-term correlation of the 
time series has a remarkable influence on the recurrence 
interval distribution: for the linear long-term correlated 
time series the recurrence interval distribution follows a 
stretched exponential [3, EH, HH, HsJ , and for the arti- 
ficial multifractal signals which have non-linear correla- 
tion the recurrence interval distribution exhibits a power- 
law behavior whose exponent varies with the threshold 
[3TI [37j . The financial time series have been reported to 
have multifractal nature, and the power-law distribution 
is indeed observed in empirical studies [l||, [H[ . Very re- 
cently, Ren and Zhou studied the recurrence intervals t 
between large price returns using the latest data of Chi- 
nese stock markets, and also found that the tail of the 
recurrence interval distributions could be approximated 
by a power law (33 |. 

The trading volume is known as an important vari- 
able reflecting the liquidity of the financial markets, and 
therefore is considered to be of great importance for 
the measurement of market liquidity risk. There have 
been numerous studies conducted to study the relation- 
ship between the trading volumes and the price returns, 
and growing evidence shows that large price movements 
might be driven by large tra ding volumes, described well 
by the price impact function [38l443j . Some of these stud- 
ies have revealed that the trading volumes and the mag- 
nitude of the price returns have universal properties, such 
as the fat-tailed distribution and the long-term memory 
effect [1^, H(| ■ The recurrence interval analysis method 
has been presently applied to many other fields, e.g., tur- 
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bulence |44| and networks [45|, |46[. In this paper, we 
attempt to study the recurrence intervals between large 
trading volumes, and test if recurrence intervals of the 
trading volumes show power-law distribution and mem- 
ory effects similar to those of the price returns. 

The paper is organized as follows. In Section 2, we in- 
troduce the data sets analyzed and the investigated vari- 
ables. Sections 3 and 4 study the probability distribution 
and the memory effects of the recurrence intervals respec- 
tively. In Section 5, we study the relationship between 
trading volumes and price returns based on the recur- 
rence interval analysis. Section 6 gives the conclusion. 



II. DATA SETS 

We analyze the 1-min intraday data of 20 liquid 
stocks actively traded on the Shanghai Stock Exchange 
and the Shenzhen Stock Exchange from January 2000 
to May 2009, the Shanghai Stock Exchange Compos- 
ite Index (SSEC) and the Shenzhen Stock Exchange 
Composite Index (SZCI) from January 2003 to April 
2009. The data arc retrieved from a database devel- 
oped by GTA Informatio n Technology Co., Ltd, see 
http://www.gtadata.com/[ Since the sampling time is 1 
minute, the total number of data points is about 340000 
for the two Chinese indices and 500000 for individual 
stocks. These 20 stocks are actively traded stocks repre- 
sentative in a variety of industry sectors. Each stock is 
uniquely identified with a stock code which is a unique 
6-digit number. A stock with the code initiated with 60 
is traded on the Shanghai Stock Exchange, while a stock 
with the code initiated with 00 is listed on the Shenzhen 
Stock Exchange. 

The 1-min trading volumes of the Chinese stock mar- 
kets exhibit a U-shaped intraday pat tern, like many 
Western stock markets [2(3, HH, [23l. |25|. The intraday 
pattern is defined as 
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which is the average volume at a specific minute s of the 
trading day averaged over all N trading days and Vi(s) 
is the trading volume at time s of day i. The intraday 
patterns of the 1-min trading volumes for the two Chi- 
nese indices and four representative stocks are illustrated 
in Fig. [TJ The Shanghai Stock Exchange and Shenzhen 
Stock Exchange open at 9:30 a.m., and close at 15:00 
p.m., during which there exists a midday break between 
11:30 a.m. and 13:00 p.m.. Similar to many Western 
stock markets, the average volume of the Chinese stock 
markets displays sharp peaks close to the opening and 
closing times. In addition, the average volume of the Chi- 
nese stock market surges soon after midday break which 
may due to the information aggregation and the cumula- 
tive orders submitted during the midday break. Taking 
a more careful look at the curves for the two Chinese 
indices, one observes that the average volume shows a 



relatively sharp jump at around 10:30 a.m.. This phe- 
nomenon can be explained by the trading rule that for 
the stocks which behaved abnormally the Stock Exchange 
will suspend trading in relevant securities. Until the day 
the parties with disclosure obligation make relevant an- 
nouncements, the stocks involved will resume trading at 
10:30 a.m. on that day. All these factors will affect 
the measurement of the trading volumes, and will conse- 
quently lead to periodic behavior in the distribution of 
recurrence intervals between large trading volumes. 
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FIG. 1: (Color online) Interval intraday patterns of trading 
volumes for SSEC, SZCI and four representative stocks. 

To avoid the effect of this periodic oscillation, the in- 
traday pattern is removed by 
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Then the volatility is normalized by dividing its standard 
deviation 
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In this paper, we mainly focus on the statistic properties 
of the recurrence intervals between normalized trading 
volumes exceeding a threshold q. For a certain value of q, 
a series of recurrence intervals r q are obtained, therefore 
we can study the statistic properties of the recurrence 
intervals for different values of q. 



III. PROBABILITY DISTRIBUTION OF 
RECURRENCE INTERVALS BETWEEN LARGE 
TRADING VOLUMES 

A. Empirical distribution 

It is well accepted that there exists a close relationship 
between price returns and trading volumes. Recent em- 
pirical studies on the recurrence intervals between price 
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returns have shown that the tail of the recurrence interval 
distribution follows a power-law scaling. Here we study 
the probability distribution function (PDF) P q (r) of the 
recurrence intervals between trading volumes, and see if 
the power-law scaling behavior maintains. 

In Fig. El the scaled PDFs P q {T){r) are plotted as 
a function of the scaled recurrence interval t/(t) for 
various values of q = 2,3,4,5 for the two Chinese in- 
dices and four representative stocks. For small values 
of t/(t), P g (r)(r} diverges for different q values, which 
may partially due to the discreteness of recurrence inter- 
vals. However, one can also observe that for large values 
of rj (t) the curves for different q values almost collapse 
onto a single curve and show nice linear behavior in the 
double logarithmic plot. Hence, we can conclude that 
the tails of the PDFs of the recurrence intervals between 
trading volumes follow a scaling behavior as 
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(4) 



This indicates that the scaled PDF for large values of the 
scaled interval is independent of the threshold q. 
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FIG. 2: (Color online) Empirical probability distributions of 
scaled recurrence intervals for different thresholds q = 2, 3, 4, 5 
for SSEC, SZCI and four representative stocks. The solid 
curves are the fitted functions cx~ s with parameters listed in 
Table GO 



B. Determining the scaling function 

The linear behavior of the scaled PDF P 9 (r)(r) for 
large values of r/ (t) in the double logarithmic plot sug- 



gests that the scaling function may follow a power law 
as 



f(r/(r)) = f(x) = cx- 5 , x>x 
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for x larger than a lower bound x min . We aggregate 
the interval samples above x m i n for different values of 
q = 2,3,4,5, and fit them using a common power-law 
function presented in Eq. ([5]). 

Recently, an efficient method of fitting power-law dis- 
tributions based on the Kolmogorov-Smirnov (KS) statis- 
tic is proposed by Clauset, Shalizi and Newman [43]. We 
use this method with the purpose of making a relatively 
accurate estimation of the parameters in Eq. (0. The 
KS statistic is defined as 



KS= max (\F 

X>X Tn \ n 
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where F is the cumulative distribution function (CDF) of 
the empirical data and Fpl is the CDF of the power-law 
fit. To make the empirical PDF and the best power-law 
fit as similar as possible, we determine the estimate of 
a^min by minimizing the KS statistic, then we estimate 
the values of the other two parameters c and S using the 
maximum likelihood method. The estimated parameters 
Xmim S, c and the resultant KS statistic are depicted 
in Table [fl and the power-law fits are correspondingly 
illustrated in Fig. [2] For the two indices, x m i n shows 
small values close to 1.4 indicating a scaling region over 
almost three orders of magnitude with power-law expo- 
nents S = 1.74 ± 0.04. Among 20 stocks, 16 stocks 
have a; m i n < 9.0 indicating a scaling region over more 
than two orders of magnitude with power-law exponents 
S = 2.0 ±0.3. The rest of stocks (600009, 600058, 600104 
and 000063) have i m i n > 9.0, and show scaling regions 
spanning almost two orders of magnitude with exponents 
larger than 2.0. 

We have performed a power-law fit for the empirical 
interval distributions, and given a relatively accurate es- 
timation of the parameters. Furthermore, a goodness-of- 
fit test should be adopted to examine the goodness of the 
power-law fit. Based upon the KS statistic, we test the 
hypothesis that the empirical PDFs could be fitted well 
by their common power-law fit. To do t his, a bootstrap 
method is adopted following Refs. [13, [3 . One thou- 
sand synthetic data sets are randomly generated from 
the best power-law fit of the empirical PDFs. For each 
synthetic data set, a KS statistic is obtained as 
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where F s i m is the CDF of the synthetic data and -Fsim.PL 
is the CDF of the power-law fit for synthetic data. The 
KS statistic for the empirical data is defined by Eq. 
The p-value is defined as the frequency that KS S i nl > 
KS, which is regarded as the probability that the power- 
law fit is coincident with the empirical PDF. 

A variant of the KS statistic, known as the KSW 
statistic [48[, is also used to perform the goodness-of-fit 
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TABLE I: Estimated parameters for the power-law fit of em- 
pirical PDFs, and goodness-of-fit tests based on KS, KSW 
and CvM statistics for two Chinese indices and 20 stocks. 
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test. The KSW statistic for the empirical data is defined 
as 



KSW = max 



\F-F Ph \ 



(8) 



The KSW statistic with this definition is more sensitive 
on the edges of the cumulative distributions. In the boot- 
strap process, 1000 synthetic data sets are generated from 
the best power- law fit, and therefore the KSW statistic 
for the synthetic data is calculated by 
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Similar to the p-value for KS statistic, the p-value for 
the KSW statistic could be obtained by calculating the 
frequency that KSW sim > KSW. 

The goodness-of-fit tests based on the KS and KSW 
statistics are carried out for the two Chinese indices and 
20 individual stocks, and the resultant p-values are de- 
picted in Table Q] Consider the significance level of 1%, 
if the p- value is larger than 0.01, we can conclude that 
the stock passes the test, and consequently the null hy- 
pothesis that the empirical PDFs of the relevant stock 
for different q values could be fitted well by their com- 
mon power-law fit is accepted. Among the 20 stocks, 
four stocks (600060,600088,000027,000066) marked with 
* fall in the test using both KS and KSW statistics, 
and three stocks (000001,000002,000021) marked with f 
fail in the test using KS or KSW statistic. In other 
words, 13 stocks pass the test using both KS and KSW 



statistics, and 16 stocks pass the test using at least one of 
KS and KSW statistics. Based upon this fact, we may 
roughly conclude that for most of the stocks the tails of 
empirical PDFs follow scaling behavior and the scaling 
function could be approximated by a power law. Similar 
results are obtained for the two Chinese indices composed 
of individual stocks traded on relevant stock exchanges: 
SSEC passes the goodness-of-fit test using both KS and 
KSW statistics, and SZCI passes the goodness-of-fit test 
using KS statistic. 

We also use other variation of the goodness-of-fit test 
on place of the KS statistic. The Cramer-von Miscs 
(CvM) statistic is another common used statistic, which 
is defined as 
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where F and Fpl are the CDFs of the empirical data 
and its power-law fit respectively, and N is the number 
of scaled interval samples x = t/(t) (49l - [5l| . For a se- 
quence of scaled interval samples x\ , X2, ■ • ■ ,xn arranged 
in ascending order, the computational form of W 2 statis- 
tic is given by 
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In Table [H W 2 for the two Chinese indices and 20 in- 
dividual stocks is also depicted. For a stock which has 
a W 2 smaller than the critical value 0.743, it success- 
fully passes the goodness-of-fit test under the signifi- 
cance level of 1%. From the table, only four stocks 
(600060,600088,000027,000066) show W 2 larger than the 
critical value, thus fail in the CvM test. In general, the 
CvM statistic offers results very similar to those of the 
KS statistic, and for the stock which has a large p- value 
close to 1.0 the W 2 statistic of the relevant stock is sig- 
nificantly smaller than the critical value. 



IV. MEMORY EFFECTS IN RECURRENCE 
INTERVALS BETWEEN TRADING VOLUMES 

A. Conditional PDF and mean conditional 
recurrence interval 

Besides the investigation of the probability distribu- 
tion, the computation of the temporal correlation offers 
another important way of understanding the statistical 
properties of the recurrence intervals between trading 
volumes. Previous studies have detected the presence of 
memory effects in the recurrence intervals between price 
returns [3l], HU, [34| • We thus conjecture similar memory 
effects may also exist in the recurrence intervals between 
trading volumes. 

To verify our conjecture, we first calculate the condi- 
tional PDF of the recurrence intervals between trading 
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volumes. The conditional PDF computes the probabil- 
ity of finding a recurrence interval r conditioned on the 
preceding interval To . The conditional PDF is calculated 
for a bin of To to get better statistics. We rank the whole 
sequence of the recurrence intervals in an ascending or- 
der, and partition it to four bins with equal size. Fig. [3] 
plots the scaled conditional PDF P 9 (t|to)(t) as a func- 
tion of the scaled recurrence interval t/(t) with To in the 
largest and smallest bins for different g-valucs for SSEC, 
SZCI and four representative stocks. The curves for dif- 
ferent g-values with t in the largest and smallest bins 
approximately collapse on to two separate solid curves 
as illustrated in Fig. [3j For small t/(t), P 9 (t|to)(t) with 
To in the smallest bin is larger than that with To in the 
largest bin, while P 9 (t|to)(t) with To in the largest bin 
shows values larger than that with To in the smallest bin 
when t I (t) is large. This means that small To tends to be 
followed by small t and large To tends to be followed by 
large t, and therefore provides an evidence of the short- 
term memory. 
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FIG. 3: (Color online) Scaled conditional PDF P 9 (t|to)(t) of 
scaled return interval r / (r) with to in the largest 1/4 subset 
(open symbols) and the smallest 1/4 subset (filled symbols) 
for SSEC, SZCI and 4 representative stocks. The solid lines 
are guidelines for looking. 

To detect the short-term memory of the recurrence in- 
tervals, we can also calculate the mean conditional recur- 
rence interval (t|to) conditioned on the preceding inter- 
val to- In Fig. 21 the scaled mean conditional recurrence 
interval (t|to)/(t) is plotted with respect to the scaled 
preceding interval To / (t) for the two Chinese indices and 
four representative stocks. Though (t\tq) / (t) strongly 
fluctuates in the whole region of tq / (t) , it approximately 



shows a monotonic increasing tendency as the increase of 
To / (t) . This indicates that for small (large) preceding in- 
terval To the mean value of the following interval is also 
small (large), and this further confirms the short-term 
memory revealed in the conditional PDF. 
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FIG. 4: (Color online) Scaled mean conditional return interval 
(t|to)/(t) as a function of scaled return interval tq/{t) for 
SSEC, SZCI and four representative stocks. 



B. Detrended fluctuation analysis 

To further investigate the long-term memory of the re- 
currence intervals between trading volumes, we use the 
detrended fluctuation analysis (DFA) method, known as 
a common method of measuring the temporal correla- 
tion of a time series [H^-EBl ■ The DFA method computes 
the detrended fluctuation function F(l) of the time series 
within a window of I data points after removing a linear 
trend, and P(Z) is expected to follow a scaling form 



F(i) ~ r. 



(12) 



The scaling exponent a contains the information of tem- 
poral correlation between the time series: if a = 0.5 the 
time series are uncorrelated, and if 0.5 < a < 1.0 the 
time series are long-term correlated. By calculating the 
exponent a of the recurrence intervals, we can investigate 
the temporal correlation between them. 

Fig. [5] (a) and (b) plot F(l) of the recurrence intervals 
for the two Chinese indices. Different symbols represent 
F(l) for different values of threshold q. One observes that 
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for both two indices F(l) for different q values approxi- 
mately obey a scaling form, and similar scaling behaviors 
are observed for the 20 individual stocks. By calculating 
the slope of its best linear fit in the double logarithmic 
plot, we can obtain the estimate of a. In Fig. [5] (c), the 
estimate values of a for the two Chinese indices and 20 
stocks arc depicted. It is clear that the values of a for all 
the indices and stocks are significantly larger than 0.5, 
which indicates there exists a long-term memory in the 
recurrence intervals. We further assume that the long- 
term memory of the recurrence intervals may arise from 
the the long-term memory of the trading volumes. To 
verify this we also calculate F (I) of the recurrence in- 
tervals between shuffled trading volumes, in which the 
long-term memory is artificially eliminated by shuffling. 
The exponent a of the recurrence intervals between shuf- 
fled trading volumes for the two Chinese indices and 20 
stocks shows values very close to 0.5, as illustrated in 
Fig.[5](c). This may provide an evidence to substantiate 
the assumption that the long-term memory of the trading 
volumes leads to the long-term memory of the recurrence 
between them. 



V. DEPENDENCE OF TRADING VOLUMES 
ON PRICE RETURNS 

A. Correlation between recurrence intervals of 
trading volumes and their preceding price returns 

Empirical studies have shown that the trading volumes 
are highly correlated with the price returns, that large 
price returns are usually accompanied by large trading 
volumes. We attempt to address the question how the 
price returns affect the trading volumes based on the re- 
turn interval analysis. Suppose v(t) and v(t + r) are two 
consecutive trading volumes exceeding threshold q and 
r(t) is the preceding return of recurrence interval r as 
illustrated in Fig. [51 the correlation function is defined 
as 
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(13) 



It measures the correlation between the magnitude of the 
price return \r(t)\ and the volume recurrence interval r 
immediately after it. Here (■ • ■ ) is the average over time 
t. 
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FIG. 5: (Color online) Detrended fluctuation function F(l) of 
return intervals for SSEC and SZCI. The curves are vertically 
shifted for clarity. Exponents a of trading volumes (black 
circles), recurrence intervals (open symbols) and shuffled re- 
currence intervals (filled symbols) for two Chinese indices and 
20 stocks. 




FIG. 6: (Color online) Correlation function between recur- 
rence intervals of trading volumes and their preceding returns 
for SSEC (filled black circles), SZCI (filled red squares), 10 
stocks traded on Shanghai Stock Exchange (open black cir- 
cles) and 10 stocks traded on Shenzhen Stock Exchange (open 
red squares). 

Fig- El plots the correlation function C(|r|,r) for the 
two Chinese indices and 20 stocks. For large q, C(\r\,r) 
for the two Chinese indices and most of the individual 
stocks are significantly smaller than zero. The negative 
value of C(|r|,T) indicates an anti-correlation between 
the volume recurrence interval and its preceding return. 
In other words, once a large positive or negative price 
change occurs, the following recurrence interval between 
large trading volumes tends to show small values, there- 
fore large trading volumes are more likely to occur after 
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large price returns. To illustrate this, we randomly select 
a certain series of trading volumes initiated with an ex- 
tremely large return r(t = 0) > 5 for SSEC, and plot it 
in Fig. [7] The trading volume strongly fluctuates short 
after the large return for t < 120, and shows on average 
larger values than that after cooling down for a while for 
t > 120. 




FIG. 7: (Color online) Evolution of a certain series of normal- 
ized trading volumes conditioned on a large preceding return 
r > 5 for SSEC. 



B. Comovement between trading volumes and 
price returns 

We have shown that there exists an anti-correlation 
between the recurrence intervals of trading volumes and 
their preceding price returns. The evolution curves of 
trading volumes and absolute price returns are plotted 
in Fig. [51 the observed comovement between them sug- 
gests that there might be some relationship between their 
recurrence intervals. Therefore, we further study the re- 
lationship between the recurrences intervals of both trad- 
ing volumes and price returns. In doing so we calculate 
the probability that the trading volumes have the same 
recurrence intervals as the price returns as illustrated in 
Fig. [5] Suppose two consecutive trading volumes exceed- 
ing threshold q occur at time t and t + r, the probability 
is calculated as 

N t, \r\>Q 



P(t I \r\ >Q) = 



(14) 



where N Tt i r i>g is the number of r conditioned on two 
consecutive absolute returns exceeding the threshold Q, 
i.e., \r(t)\ > Q and \r(t + r) > Q, and N T is the total 
number of r. 

In Fig. IH1 P{t I \r\ > Q) is plotted as a function of the 
threshold Q for the two Chinese indices and four repre- 
sentative stocks. In principle, P(t \ \r\ > Q) equals one 




FIG. 8: (Color online) Illustration of normalized trading vol- 
umes and absolute price returns which have same recurrence 
intervals for SSEC. 



when Q = 0, and approximately follows a power-law de- 
cay for large scales of Q as shown in the insets in Fig. [9] 
One observes that P(r | \r\ > Q) for large q is obviously 
larger than that for small q, displaying a power-law tail 
with an exponent similar to that for small q. This indi- 
cates that the comovement between trading volumes and 
price returns is more pronounced for large thresholds q 
and Q, which further confirms the close relationship be- 
tween the trading volumes and the price returns. 



VI. CONCLUSION 

In this paper, we have introduced the recurrence in- 
terval analysis to the study of stock market trading vol- 
umes. The recurrence intervals between the trading vol- 
umes above a threshold q for the two Chinese indices and 
20 liquid Chinese stocks are carefully studied. Though 
the scaled PDFs of the recurrence intervals for different 
q values diverge slightly for small intervals, they tend to 
show scaling behavior for large intervals. The goodness 
of fit tests are further performed to find out the specific 
form of the scaling function, and the results demonstrate 
that it could be approximated by a power law. The mea- 
surements of the conditional PDF, the mean conditional 
recurrence interval and the DFA method are carried out 
to detect the short-term and long-term memory of the 
recurrence intervals, and both memory effects are clearly 
observed. 

The study of market impact of trading volumes is sup- 
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a q 



FIG. 9: (Color online) Probability of volume intervals condi- 
tioned on large volatilities |r| > Q for SSEC, SZCI and four 
representative stocks. 

posed to be of great importance for both theoretical and 
practical reasons. Here, we tried to study the correla- 



tion between trading volumes and price returns using the 
recurrence interval analysis method. We calculated the 
correlation between the recurrence intervals of trading 
volumes and their proceeding price returns, and found 
that the trading volumes following large price returns are 
more likely to show large values, which may corresponds 
to the unstable state when after a market burst. The pro- 
nounced comovcmcnt between large trading volumes and 
large price returns further confirms our previous findings 
that large price returns are usually accompanied by large 
trading volumes. In fact, our study mainly focuses on the 
relationship between trading volumes and the magnitude 
of price returns. Further study concerning the depen- 
dence of trading volumes on price returns taking into 
account the effect of the signs of price returns needs to 
be proceeded, and this will certainly help to study the 
risk estimation in financial markets. 
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